SUBMITTED TO THE IEEE TRANSACTIONS ON COMMUNICATIONS 



1 



(N 



\0' 



X: 



Per-antenna Constant Envelope Precoding for 
Large Multi-User MIMO Systems 

Saif Khan Mohammed* and Erik G. Larsson 

Abstract 



• We consider the multi-user MIMO broadcast channel with M single-antenna users and N transmit antennas 

under the constraint that each antenna emits signals having constant envelope (CE). The motivation for this is 



that CE signals facilitate the use of power-efficient RF power amplifiers. Analytical and numerical results show 
that, under certain mild conditions on the channel gains, for a fixed M, array gain is achievable even under the 
stringent per-antenna CE constraint (essentially, for a fixed M, at sufficiently large N the total transmitted power 
can be reduced with increasing N while maintaining a fixed information rate to each user). Simulations for the 
Q , i.i.d. Rayleigh fading channel show that the total transmit power can be reduced linearly with increasing N (i.e., an 



0{N) array gain). We also propose a precoding scheme which finds near-optimal CE signals to be transmitted, and 



^ . has 0{MN) complexity. Also, in terms of the total transmit power required to achieve a fixed desired information 

'NT 



sum-rate, despite the stringent per-antenna CE constraint, the proposed CE precoding scheme performs close to 
the sum-capacity achieving scheme for an average-only total transmit power constrained channel. 



. Index Terms 

(N 



Multi-user, constant envelope, per-antenna. Large MIMO, GBC. 

I I. Introduction 

We consider a Gaussian Broadcast Cliannel (GBC), wherein a base station (BS) having antennas 
communicates with M single- antenna users in the downlink. Large antenna arrays at the BS has been of 
recent interest, due to their remarkable ability to suppress multi-user interference (MUI) with very simple 
precoding techniques [HI. Specifically, under an average only total transmit power constraint (APC), for a 
fixed M, a simple matched-filter precoder has been shown to achieve total MUI suppression in the limit 

The authors are with the Communication Systems Division, Dept. of Electrical Engineering (ISY), Linkoping University, Linkoping, 
Sweden. This work was supported by the Swedish Foundation for Strategic Research (SSF) and ELLIIT. E. G. Larsson is a Royal Swedish 
Academy of Sciences (KVA) Research Fellow supported by a grant from the Knut and Alice Wallenberg Foundation. Parts of the results in 
this paper were presented at IEEE ICASSP 2012 IT?! . Also, the simpler special case of Af = 1 (i.e., single-user) has been studied by us in 
much greater detail in 1161 . 
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as — )■ cxo [|2l . Additionally, due to the inherent array power gain propertjo large antenna arrays are also 
being considered as an enabler for reducing power consumption in wireless communications, especially 
since the operational power consumption at BS is becoming a matter of world-wide concern H, [[5]|. 



ly building them would require cheap and 
J With current technology, power-efficient 



Despite the benefits of large antenna arrays at BS, practica 
power-efficient RF components like the power amplifier (PA) 
RF components are generally non-linear. The type of transmitted signal that facilitates the use of most 
power-efficient/non-linear RF components, is a constant envelope (CE) signal. In this paper, we therefore 
consider a GBC, where the signal transmitted from each BS antenna has a constant amplitude for every 
channel-use and which is independent of the channel realization^ 

Since, the per-antenna CE constraint is much more restrictive than APC, we investigate as to whether 
MUI suppression and array power gain can still be achieved under the stringent per-antenna CE constraint? 

To the best of our knowledge, there is no reported work which addresses this question. Most reported 
work on per-antenna communication consider an average-only or a peak-only power constraint (see jU, 
[|7l and references therein). In this paper, firstly, we derive expressions for the MUI at each user under the 
per-antenna CE constraint, and then propose a low-complexity CE precoding scheme with the objective of 
minimizing the MUI energy at each user. For a given vector of information symbols to be communicated 
to the users, the proposed precoding scheme chooses per-antenna CE transmit signals in such a way that 



^ Under an APC constraint, for a fixed Af and a fixed desired information sum-rate, the required total transmit power decreases witli 
increasing A'^ ||3]. 

^In conventional BS, power-inefficient PA's contribute to roughly 40-50 percent of the total operational power consumption (5). 
In this paper, we only consider the discrete-time complex baseband equivalent channel model, where we aim to restrict the discrete-time 

per-antenna channel input to have no amplitude variations. Compared to precoding methods which result in large amplitude-variations in the 

discrete-time channel input, the CE precoding method proposed in this paper is expected to result in continuous-time transmit signals which 

have a significantly improved peak-to-average-power-ratio (PAPR). However, this does not necessarily mean that the proposed precoding 

method will result in continuous-time transmit signals having a perfectly constant envelope. Generation of perfectly constant envelope 

continuous-time transmit signals has not been covered in this paper, and constitutes future work for us. One possible method to generate 

almost constant-envelope continuous-time signals could be that, in addition to constraining the discrete-time channel input to have no 

amplitude variations, one could also consider constraining the phase variation between consecutive symbols of the discrete-time channel 

input. 
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the MUI energy at each user is smalllfl 

Secondly, under certain mild channel conditions (including i.i.d. fading), using a novel probabilistic 
approach, we analytically show that, MUI suppression can be achieved even under the stringent per- 
antenna CE constraint. Specifically, for a fixed M and fixed user information symbol alphabets, an 
arbitrarily low MUI energy can be guaranteed at each user, by choosing a sufficiently large A^. Our 
analysis further reveals that, with a fixed M and increasing N, the total transmitted power can be reduced 
while maintaining a constant signal-to-interference-and-noise-ratio (SINR) level at each user. 

Thirdly, through simulation, we confirm our analytical observations for the i.i.d. Rayleigh fading channel. 
For the proposed CE precoder, we numerically compute an achievable ergodic information sum-rate, and 
observe that, for a fixed M and a fixed desired ergodic sum-rate, the required total transmit power reduces 
linearly with increasing N (i.e., achievability of an 0{N) array power gain under the per-antenna CE 
constraint). We also observe that, to achieve a given desired ergodic information sum-rate, compared to 
the optimal GBC sum-capacity achieving scheme under APC, the extra total transmit power required by 
the proposed CE precoding scheme is small (roughly 2.0 dB for sufficiently large A^). 

Notation: C and R denote the set of complex and real numbers. \x\, x* and arg(x) denote the absolute 
value, complex conjugate and argument of x G C respectively. ||hp = XlJ^iP denotes the squared 
Euclidean-norm of h = (/ii, ■ ■ ■ , /ijy) G C^. E[-] denotes the expectation operator. Abbreviations: r.v. 
(random variable), bpcu (bits-per-channel-use), p.d.f. (probability density function). 



II. System Model 

Let the complex channel gain between the i-th BS antenna and the fc-th user be denoted by /i^ j. The 
vector of channel gains from the BS antennas to the /c-th user is denoted by h^. = (/ia:,i, /ifc,2, ■ ■ ■ , hk^NY ■ 
H G C^^^^ is the channel gain matrix with hk^i as its (A;,z)-th entry. Let Xj denote the complex symbol 
transmitted from the z-th BS antenna. Further, let Pt denote the average total power transmitted from 
all the BS antennas. Under APC, we must have lE[^^i = Pt, whereas under the per-antenna CE 

'Here "small" implies that the MUI energy is of the same order or less than the variance of the additive white Gaussian noise (AWGN) 
at the receiver. Throughout the paper, we assume that such large antenna systems will operate in a regime where the information rate 
performance is not critically limited by MUI. This is because, it is highly power-inefficient to operate in a regime where the MUI energy is 
significantly more than the AWGN variance (8). 
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constraint we have = Pt/N , i = 1,2, - ■ ■ , N which is clearly a more stringent constraint compared 



to APC. Further, due to the per-antenna CE constraint, it is clear that x-i is of the form Xi = Pr/Ne^^^ 
where 9i is the phase of xiU Under CE transmission, the symbol received by the k-th user is therefore 
given by 

fp- N 

y^ = y^J2 , k = i,2,...,M (1) 

1=1 

where Wk ~ CM{0, cr^) is the AWGN noise at the k-th receiver. For the sake of notation, let 6 = 
(^1, ■ ■ ■ ,6n)'^ denote the vector of transmitted phase angles. Let u = {^/E^Ul, ■ ■ ■ , \/EmUmY be the 
vector of scaled information symbols, with Uk E Uk denoting the information symbol to be communicated 
to the fc-th user. Here Uk denotes the unit average energy information alphabet of the A;-th user. Ek,k = 
1, 2, . . . , M denotes the information symbol energy for each user. Also, let U = y/E^Ui X v^Ws X ■ ■ ■ X 



^JEmUm- Subsequently, in this paper, we are interested in scenarios where M is fixed and N is allowed 
to increase. Also, throughout this paper, for a fixed M, the alphabets Wi, ■ ■ ■ Mm are also fixed and do 
not change with increasing N . 



III. MUI Analysis and the Proposed CE Precoder 

For any given information symbol vector u to be communicated, with 6 as the transmitted phase angle 
vector, using ([T]) the received signal at the A;-th user can be expressed as 

^ /V^ hkiC^^' \ 

Vk= y PryEkUk + V PrSk + Wk , Sk = [ — '"^ r—' V ^kUk (2) 

V VN / 

where y/Pr Sk is the MUI term at the k-th user. In this section, for any general CE precoding scheme 
where the signal transmitted from each BS antenna has constant envelope, through analysis, we aim to 
get a better understanding of the MUI energy level at each user. Towards this end, we firstly study the 
range of values taken by the noise-free received signal at the users (scaled down by y/Pr)- This range of 

' Note that CE transmission is entirely different from equal gain transmission (EGT). We explain this difference for the simple single-user 
scenario (Af = 1). In EGT a unit average energy complex information symbol u is communicated to the user by transmitting Xi = WiU from 
the j-th transmit antenna (with \wi \ = • • ■ = |uijv| = ^/Pr/N), and therefore the amplitude of the signal transmitted from each antenna is 
not constant but varies with the amplitude of u {\xi\ = Pt /N\u\). In contrast, the CE precoding method proposed in this paper (Section 
IIII-Bt transmits a constant amplitude signal from each antenna (i.e., Pt /Ne-''^^ from the i-th antenna), where the transmit phase angles 
^1, • • • ,6n are chosen in such a way that the noise-free received signal is a known constant times the desired information symbol u. 
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values is given by the set 

M{H) ^ L = {v,,--- , vm) eC^ \ Vk= ^'=' , e, e [-1T, tt) , z = 1, ■ ■ ■ , iv| (3) 

For any vector v = {vi,V2, - ■ ■ , vm)'^ ^ A^(H), from © it follows that there exists a 9^ = {9'^, ■ ■ ■ , 9'^)'^ 

such that Vk = , = 1, 2, . . . , M. This sum can now be expressed as a sum of N/M terms 

(without loss of generality let us assume that N/M is integral only for the argument presented here) 

N/M qM 
g=l r=(g-l)M+l 

From dll) it immediately follows that A^(H) can be expressed as a direct-sum of N/M sets, i.e. 

M (H(^^)) 4 {v = (^1, . . . , z;,,) e C^^ I V, = ^^=1 e ^ ^ , g = 1, . . . , iV/M 

(5) 

where H*^"?) is the sub-matrix of H containing only the columns numbered {q — 1)M + 1, (g — 1)M + 
2,-- - ,qM. A1(H('?)) C C^^ is the dynamic range of the received noise-free signals when only the 
M BS antennas numbered {q — 1)M + 1, (g — 1)M + 2, ■ ■ ■ , qM are used and the remaining N — M 
antennas are inactive. If the statistical distribution of the channel gain vector from a BS antenna to 
all the users is identical for all the BS antennas (as in i.i.d. channels), then, on an average the sets 
Ai (H*^'')) , g = 1, . . . , N/M would all have similar topological properties. Since, A^(H) is a direct-sum 
of N/M topologically similar sets, it is expected that for a fixed M, on an average the region A^(H) 
expands with increasing N. Specifically, for a fixed M and increasing N, the maximum Euclidean length 
of any vector in A^(H) grows as 0{\/N), since A^(H) is a direct-sum of 0{N) topologically similar 
sets (A^(H^'^^) , g = 1, 2, . . . , N/M) with the maximum Euclidean length of any vector in Ai (H*^'^^) being 
0{1/\/N) (note that in the definition of A^(H^'^)) in ([5]), each component of any vector v e A^(H(^)) 
is scaled down by V^). Also, for a fixed M and increasing A^, since A^(H) is a direct-sum of N/M 
similar sets, it is expected that the set A^(H) becomes increasingly dense (i.e., the number of elements of 
A^(H) in a fixed volume in C^''^ is expected to increase with increasing A^). The above discussion leads 
us to the following results in Section IIII-AI and IIII-CI 
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A. Diminishing MUI with increasing N, for fixed M and fixed Ek{k = 1, . . . , M) 

For a fixed M and fixed Ek, the information alphabets and the information symbol energies are fixed. 
However, since increasing (with fixed M) is expected to enlarge the set (H) and make it increasingly 
denser, it is highly probable that at sufficiently large N, for any fixed information symbol vector u = 
{^/ElUl, ■ ■ ■ , \/EmUm)'^ £ W there exists a vector v G A^(H) such that v is very close to u in terms of 
Euclidean distance. This then implies that, with increasing N and fixed M, for any u eU there exists a 
transmit phase angle vector such that the sum of the MUI energy for all users is small compared to the 
AWGN variance at the receiver. Hence, for a fixed M and fixed Ek, it is expected that the MUI energy 
for each user decreases with increasing A^. 

This is in fact true, as we prove it formally for channels satisfying the following mild conditions. 
Specifically for a fixed M, we consider a sequence of channel gain matrices {iiN}N=M satisfying 

lim ' ^ ^ ' ' = , Wkj^l , k,l e (1,...,M) (As.l) 

l/i^^^l l/i^^^l l/i^^^l l/i^^^l 
lim ^^=^' = 0, VA;i,/i,A;2,/2G(l,2,...,M)(As.2) 

N^oa iV 

lim " '/^ = Cfc , fc = l,2,...,M(As.3) (6) 

where are positive constants, h^^-* denotes the fc-th row of H^v and h\^J denotes the i-th component of 
h^^-*. From the law of large numbers, it follows that i.i.d. channels satisfy these conditions with probability 
one lfT3l . Physical measurements of the channel characteristics with large antenna arrays at the BS have 
revealed closeness to the i.i.d. fading model, as long as the BS antennas are sufficiently spaced apart 
(usually half of the carrier wavelength) [[T4|. [[U. 

Theorem 1: For a fixed M and increasing N, consider a sequence of channel gain matrices {Hn}']^^]^ 
satisfying the mild conditions in Q. For any given fixed finite alphabet U (fixed Ek, k = 1, . . . , M) and 
any given A > 0, there exist a corresponding integer N {{Ylf^} ,U , A) such that with N > N({iij^},U, A) 
and Hjv as the channel gain matrix, for any u G W to be communicated, there exist a phase angle vector 
Q'%{A) = (6'"(A), ■ ■ ■ ,6']^(A))^ which when transmitted, results in the MUI energy at each user being 
upper bounded by 2A^, i.e. 



N 



EkUk 



^ <2A'^ , k = l,...,M. (7) 
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Proof - The proof relies on technical results stated and proved in Appendix lAl and iBl All these results 

assume a fixed M (number of user terminals) and increasing N (number of BS antennas). These results 

are stated for a fixed sequence of channel matrices {HatI'^^^j, fixed information alphabets Ui, ■ ■ ■ ,Um 

and fixed information symbol energy Ei, ■ ■ ■ , Em. Further, the sequence of channel matrices {H.n}n=m 

is assumed to satisfy the conditions in Q and the information alphabets are assumed to be finite/discrete. 

The proofs use a novel probabilistic approach, treating the transmitted phase angles as random variables. 

We now present the proof of Theorem [B 

Let us consider a probability space with the transmitted phase angles 9i,i = 1,2,...,N being i.i.d. 
r.v's uniformly distributed in [— vr , tt). For a given sequence of channel matrices {Hat}, we define a 
corresponding sequence of r.v's {z^v}, with = {zi^\ \ ■ ■ ■ , z^m'^\ z^l ^) G M^*^, where we have 

z, = Re( ^= =Im( -= j,fc=l,...,M. (8) 



From Theorem [2] in Appendix |A] it follows that, for any channel sequence {Hat} satisfying the 
conditions in Q, as iV — )■ oo (with fixed A/), the corresponding sequence of r.v's {z^} converges 
in distribution to a 2A/-dimensional real Gaussian random vector X = {X[,X^, ■ ■ ■ ,X^j,X^)-^ with 
independent zero-mean components and var(X^) = var(X^) = Cfc/2, k = 1,2,...,M. For a given 



u = {^/E^Ul, ■ ■ ■ , yJEMUMY ^ A > 0, wc next consider the box 



where = Re(Mfc) , = Im(Mfc). The box B^iu) contains all those vectors in M?^^ whose component- 
wise displacement from u is upper bounded by A. Using the fact that z^ converges in distribution to a 
Gaussian r.v. with M^^^ as its range space, in Theorem [3] (Appendix |B]) it is shown that, for any A > 0, 
there exist an integer A^({HAr}, W, A), such that for all > A^({HAr}, W, A) 

Prob(zAr e S^(u)) > , VueW. (10) 

Since the probability that z^v lies in the box B^{\i) is strictly positive for all u E U, from the definitions 
of B^{u) in dll) and z^ in dS]) it follows that, for any u eU there exist a phase angle vector 0^(A) = 
(^i'(A),--- ,e'^{A)f such that 



Re( ^= )-VEkU, 

for all /c = 1, 2, ■ ■ ■ , M, which then implies 



Im 



<A (11) 



SUBMITTED TO THE IEEE TRANSACTIONS ON COMMUNICATIONS 8 

Since Theorem [T] is valid for any A > and ([7]) holds for all > A^({Hjv}, W, A), we can satisfy 
(|7]) for any arbitrarily small A > by having N > N({ii]\f},U, A) i.e., a sufficiently large N. Hence, 
the MUI energy at each user can be guaranteed to be arbitrarily small by having a sufficiently large N . 
Theorem 1 therefore motivates us to propose precoding techniques which can achieve small MUI energy 
levels as guaranteed by the theorem. 



B. Proposed CE Precoding Scheme 

For reliable communication to each user, the precoder at the BS must choose a 9 such that the MUI 
energy \sk\ is as small as possible for each k = 1, 2, ... , M. This motivates us to consider the following 
non-linear least squares (NLS) problem, which for a given u to be communicated, finds the transmit phase 
angles that minimize the sum of the MUI energy for all users: 

9" = {ei,--- ,ei) = ^Tg min (7(9, u) 

eiG[-7r,7r) , j=l,...,Af 

= 2^\sk\ = y= VEkUk ■ (12) 

k=l ' ' k=l ' V ^ 

This NLS problem is non-convex and has multiple local minima. However, as the ratio N/M becomes 
large, due to the large number of extra degrees of freedom (N — M), the value of the objective function 
g{Q, u) at most local minima has been observed to be small, enabling gradient descent based methods to 
be usedO However, due to the slow convergence of gradient descent based methods, we propose a novel 
iterative method, which has been experimentally observed to achieve similar performance as the gradient 
descent based methods, but with a significantly faster convergence. 

In the proposed iterative method to solve (fT2)) . we start with the p = 0-th iteration, where we initialize 
all the angles to 0. Each iteration consists of sub-iterations. Let 9'^^''^^ = {6^'''\ ■ ■ ■ , denote the 

phase angle vector after the g-th sub-iteration (q = 1,2, . . . , N) of the p-th iteration (subsequently we shall 
refer to the q-th sub-iteration of the p-th iteration as the {p, g)-th iteration). After the (p, g)-th iteration, 
the algorithm moves either to the (p, q + l)-th iteration (if q < N), or else it moves to the (p + 1, l)-th 
iteration. In general, in the (p, q + l)-th iteration, the algorithm attempts to reduce the current value of 

^ This observation is expected, since the strict positivity of the box event probabihty in dlOt (proof of Theorem 1), implies that there are 
many distinct transmit phase angles Q such that the received noise-free vector lies in a small 2A/-dimensional cube (box) centered at the 
desired information symbol vector u, i.e., the MUI energy at each user is small for many different O. 
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the objective function i.e., g{Q^P''^\u) by only modifying the {q + l)-th phase angle (i.e., O'f^f) while 
keeping the other phase angles fixed to their values from the previous iteration. The new phase angles 
after the (p, q + l)-th iteration, are therefore given by 




g{e,u] 










h* 

"'k,q+l 


\(^ 







N 



/ A/ . N 

hk,i e-'^""'"' j - \/E'kUk 

1,7^(<?+1) 

^ ^f'^) , z = l,2,...,iV,z^g + l. (13) 

The algorithm is terminated after a pre-defined number of iterations ^ We denote the phase angle vector 
after the last iteration by §" = 0X,--- , ^]^)^. 

With 0" as the transmitted phase angle vector, the received signal and the MUI term are given by 

Vk = yPrwEkUk + yPrSk + Wfc , Sk = y -7= \/ EkUkj (14) 



The received signal-to-noise-and-interference-ratio (SINR) at the k-\h user is therefore given by 



where E = E2, ■ ■ ■ , Em^ is the vector of information symbol energies. Note that the above SINR 
expression is for a given channel realization H. For each user, we would be ideally interested to have a 
low value of the MUI energy E[|sfcP], since this would imply a larger SINR. 

To illustrate the result of Theorem 1, in Fig. [H for the i.i.d. CA/'(0, 1) Rayleigh fading channel, with 
fixed information alphabets Ui = U2 = ■ ■ ■ = Um = (16-QAM and Gaussian) and fixed information 
symbol energy Ek = l,k = 1,..., M, we plot the ergodic (averaged over channel statistics) MUI energy 
IEh[|sa:P] with the proposed CE precoding scheme (using the discussed iterative method for solving ([721) ) 



as a function of increasing (% is given by (1141)') J I It is observed that, for a fixed M, fixed information 
alphabets and fixed information symbol energy, the ergodic per-user MUI energy decreases with increasing 

^ Experimentally, we have observed that, for the i.i.d. Rayleigh fading channel, with a sufficiently large N/M ratio, beyond the p = L-th 

iteration (where L is some constant integer), the incremental reduction in the value of the objective function is minimal. Therefore, we 

terminate at the L-th iteration. Since there are totally LN sub-iterations, from the phase angle update equation in l ll3t . it follows that the 

complexity of the proposed iterative algorithm is 0{MN). 

' We have observed that Enil^fep] is the same for all = 1, . . . , M. 



SUBMITTED TO THE ffiEE TRANSACTIONS ON COMMUNICATIONS 10 

number of BS antennas A^. This is observed to be true, not only for a finite/discrete 16-QAM information 
symbol alphabet, but also for the non-discrete Gaussian information alphabet. 



C. Increasing with increasing N, for a fixed M, fixed Wi, ■ ■ ■ Mm and fixed desired MUI energy level 

It is clear that, for a fixed M and N, increasing Ek, k = 1, . . . , M would enlarge U which could then 
increase MUI energy level at each user (enlarging U might result in U ^ A^(H)). However, since an 
increase in (with fixed M and Ek) results in a reduction of MUI (Theorem [U, it can be argued that 
for a fixed M, with increasing the information symbol energy of each user (i.e., Ek , k = 1, ■ ■ ■ , M) 
can be increased while maintaining a fixed MUI energy level at each user. Further, from it is clear 
that for a fixed Pt the effective SINR at the A;-th user (i.e., -E'fe/(Eu[|sfcp] + a"^ / Pt)) will increase with 
increasing N, since Ek can be increased while maintaining a constant MUI energy. Finally, since a'^ / Pt 
increases with decreasing Pt and the MUI energy is independent of Pt, by appropriately decreasing 
Pt and increasing Ek with increasing N (fixed M), a constant SINR level can be maintained at each 
user. 

This observation is based entirely on Theorem 1 (which holds for a broad class of fading channels 
satisfying the conditions in Q, including i.i.d. fading channels). The above observation implies that 
as long as the channel satisfies the conditions in the total transmit power can be reduced without 
affecting user information rates, by using a sufficiently large antenna array at the BS with constant envelope 
transmission (i.e., an achievable array gain greater than one). We illustrate this through the following 
example using the proposed CE precoding scheme. Let the fixed desired ergodic MUI energy level for the 
fc-th user be denoted by 1^ , k = 1, 2, ■ ■ ■ , M. For the sake of simplicity we consider Ui=U2 = ■ ■ ■ = Um- 
Consider 

E* = max p (16) 

p>0 I Efc=p,EH[E„^,...,„^^ [|?fep]] =h , k=l,-,M 

which finds the highest possible equal energy of the information symbols under the constraint that the 
ergodic MUI energy level is fixed at , = 1, 2, ■ ■ ■ , M. In (fT6l) . % is given by (fT4)) . In Fig. [2l for the 
i.i.d. Rayleigh fading channel, for a fixed M = 12 and a fixed Ui = ■ ■ ■ = Um = (16-QAM and Gaussian), 
we plot E* as a function of increasing N, for two different fixed desired MUI energy levels, Ik = 0.1 
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. From Fig. [21 it can be observed that for a fixed M and fixed 
Ui, - ■ ■ ,Um, E* increases linearly with increasing N, while still maintaining a fixed MUI energy level at 
each user. At low MUI energy levels, from (fISl) it follows that 7^ ^ PrEk/o-'^. Since Ek (k = 1,2, ■ ■ ■ , M) 
can be increased linearly with (while still maintaining a low MUI level), it can be argued that a desired 
fixed SINR level can be maintained at each user by simply reducing Pt linearly with increasing N . This 
suggests the achievability of an 0{N) array power gain for the i.i.d. Rayleigh fading channel. In the next 
section we derive an achievable sum-rate for the proposed CE precoding scheme, using which (in Section 
rvT) . for an i.i.d. Rayleigh fading channel, through simulations we show that indeed an 0{N) array power 
gain can be achieved. 



and Ji. = 0.01 (same K for each user 
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IV. Achievable information sum rate 



In this section we study the ergodic information sum-rate achieved by the CE precodingscheme proposed 
in Section IIII-BI For a given channel realization H, Gaussian information alphabetoQ Wi, - ■■ ,Um, 
information symbol energies Ei,--- ,Em and total transmit power to receiver noise ratio Pxjo'^, the 
mutual information between and is given by 

I{yk]Uk) = h{Uk) - h{Uk\yk) = h{Uk) - hfuk rrr Uk] > h{Uk) - h(uk j^^j=) (17) 

V \ P^\ hii. ' ^ \ P^\ hii.' 



Due to same channel gain distribution and information alplrabet for eacli user, it is observed that the ergodic MUI energy level at each 

user is also same if the users have equal information symbol energy. 

"We restrict the discussion to Gaussian information alphabets, due to the difficulty in analyzing the information rate achieved with discrete 

alphabets. This is not a concern since, through Figs. [T] and |2l we have already observed that the two important results in Section IlII-AI and 

Illl-Cl hold true for Gaussian alphabets as well. 

'^We would also like to mention here that Gaussian information alphabets need not be optimal w.r.t. achieving the maximum sum-rate of a 

per-antenna CE constrained GBC. As an example, in 1161 , we have considered the capacity of a single-user MISO channel with per-antenna 

CE constraints at the transmitter. Due to the scenario in 1161 being simpler compared to the multi-user scenario discussed here, in 1161 

we were able to show that the optimal capacity achieving complex alphabet is discrete-in-amplitude and uniform-in-phase (DAUIP) (i.e., 

non-Gaussian). However, since it appears that the analytical tools and techniques in II16I cannot be used to derive the optimal alphabet for 

the multiuser scenario, we restrict ourselves to Gaussian alphabets here. 
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where h{z) denotes the differential entropy of a continuous valued r.v. z. The inequality in (fTVl ) follows 
from the fact that conditioning of a r.v. reduces its entropy. Further, using (fT4)) in (fTTI) we have 



I{yk\Uk) > h{uk)-h 



Sk 

J Eh 



+ 



> log2(vre) - log2 l^vrevar 

> log2(7re) - log2 vreE 



PrVEk 

'Sk 



log2(7re) - h 



Sk 

/El. 



Wk 



/Ek 

'Sk 



+ 



Wk 



log2(7re) -log2 I ire 



jEk 
Ek 



+ 



^xfEk- 

Wk 

\fP^xfE'k 



PrEk 



log2(7fc(H,E,^) 



(18) 



where Rk (ii, E, j = log2 ^7a;(H, E, ■^) j is an achievable information rate for the A;-th user, with the 
proposed CE precoding scheme. In (fTSi) . we have used the fact that the differential entropy of a complex 
Gaussian circular symmetric r.v. z having variance is log2(7recr^). Further, for any complex scalar r.v. 
z, \ar[z] = K[\z — E[2;]p]. The second inequality in (fTSi) follows from the fact that, for a complex scalar 
r.v., among all possible probability distributions having the same variance, the complex circular symmetric 
Gaussian distribution is the entropy maximizer [9]. The third inequality follows from the fact that, for 
any complex scalar r.v. z, var[z] < E[|2;p]. From (fTSi) it follows that an achievable ergodic information 
sum-rate for the GBC under the per-antenna CE constraint, is given by 

M 



k=l 



(19) 



Subsequently, we consider the scenario where all users have the same unit energy Gaussian information 
alphabet (i.e., Ui = ■ ■ ■ = Um) and the same information symbol energy (i.e., Ei = E2 = ■ ■ ■ = -Em)Q 
Further optimization of R^^ (^E, j over E subject to i^i = ■ ■ ■ = Em, results in an achievable ergodic 
information sum-rate which is given by 



max 

E\Et_=E2=---=Em>0 



(20) 



" We impose this constraint so as to reduce the number of parameters involved, thereby simplifying the study of achievable rates in 
a multi-user GBC with per-antenna CE transmission. Nevertheless, for the i.i.d. Rayleigh fading channel with each user having the same 
Gaussian information alphabet, it is expected that the optimal E which maximizes the ergodic sum-rate in l |19t , has equal components. 
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Since it is difficult to analyze the sum-rate expression in (|20l) . we have studied it through exhaustive 
numerical simulations for an i.i.d. CJ\f{0, 1) Rayleigh fading channel. In the following section, we present 
some important observations based on these numerical experiments. 

V. Simulation results on the achievable ergodic information sum-rate R^^(^^^ 

All reported results are for the i.i.d. CJ\f{0, 1) Rayleigh fading channel. In Fig. [3l for a fixed M we plot 
the minimum Pt/ct^ required by the proposed CE precoder, to achieve an ergodic per-user information 
rate of R^^{Pt / cr'^) / M = 2 bits-per-channel-use (bpcu) as a function of increasing N (Due to the same 
channel distribution for each user, we have observed that the ergodic information rate achieved by each 
user is 1/M of the ergodic sum-rate). The minimum required Pt/ct'^ is also tabulated in Table HI It is 
observed that, for a fixed M, at sufficiently large N, the required Pr/cr'^ reduces by roughly 3 dB for 
every doubling in (i.e., the required Prjfy'^ reduces linearly with increasing N). This shows that, for 
a fixed M, an array power gain of 0[N) can indeed be achieved even under the stringent per-antenna 
CE constraint. For the sake of comparison, we have also plotted a lower bound on the Pt/ct^ required 
to achieve a per-user ergodic rate of 2 bpcu under the APC constraint (we have used the cooperative 



upper bound on the GBC sum-capacity [iTOlD l I We observe that, for large N and a fixed per-user desired 
ergodic information rate of 2 bpcu, compared to the APC only constrained GBC, the extra total transmit 
power (power gap) required under the more stringent per-antenna CE constraint is small (1.7 dB). 

In Fig. [3l we also consider another CE precoding scheme, where, for a given information symbol 
vector u, the precoder firstly computes the zero-forcing (ZF) vector x = H^u, (H^ = ^HH^ j is 
the pseudo-inverse of H). Prior to transmission, each component of x is normalized to have a modulus 



equal to ^,/Pt/N, i.e., the signal transmitted from tie i-th BS antenna is Pt/N Xi/\xi\. At each user. 



the received signal is scaled by a fixed constantllf 



We shall hence-forth refer to this precoder as the 



ZF phase-only precoder. In Fig. [3l we observe that the Pt/c'^ required by the proposed CE precoder is 
always less than that required by the ZF phase-only precoder. In fact, for moderate values of N/M, the 
proposed CE precoder requires significantly less Pt/u^ as compared to the ZF phase-only precoder (e.g. 
with = 100, M = 40, the required Pt/c'^ with the proposed CE precoder is roughly 3 dB less than that 

The cooperative upper bound on the GBC sum capacity gives a lower bound on the Pt jo^ required by a GBC sum-capacity achieving 

scheme to achieve a given desired ergodic information sum-rate. 

'^This constant is chosen in such a way that the ergodic per-user information rate is maximized. It is therefore fixed for all channel 

realizations and depends only upon the statistics of the channel, Pt/o'^, N and M. 
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required with the ZF phase-only precoder). However, at very 



arge values of N/M, the ZF phase-only 







precoder has similar performance as the proposed CE precoder. 

To gain a better understanding of the power-efficiency of the considered CE precoders, in Fig. IH for 
a fixed = 48, M = 12 we plot an upper bound on the extra Pr/cr^ required by the considered CE 
precoding schemes when compared to a GBC sum-capacity achieving scheme under APcllzl as a function 
of the desired per-user ergodic information rate (note that in Fig. [3l the desired per-user rate was fixed to 2 
bpcu). It is observed that, for a desired ergodic per-user information rate below 2 bpcu, the ZF phase-only 
precoder requires roughly 1 — 1.5 dB more transmit power as compared to the proposed CE precoder. For 
rates higher than 2 bpcu, this gap increases very rapidly (at 3 bpcu, this power gap is roughly 6 dB). In 
Fig. [51 we plot the results of a similar experiment but with = 480, M = 12 (a very large ratio of N/M). 
It is observed that, the ZF phase-only precoder has similar performance as the proposed CE precoder for 
per-user ergodic information rates below 3 bpcu. For rates higher than 3 bpcu, the performance of the ZF 
phase-only precoder deteriorates rapidly, just as it did in Fig. HI In Figs. |4] and [51 we also note that the 
extra total transmit power required by the proposed CE precoder (Section IIII-BI) increases slowly w.r.t. 
increasing rate, and is less than 2.5 dB for a wide range of desired per-user information rates. From 
exhaustive experiments, we have concluded that, for moderate values of N/M, the proposed CE precoder 
is significantly more power efficient than the ZF phase-only precoder, whereas for very large N/M both 
precoders have similar performance when the desired per-user ergodic information rate is below a certain 
threshold (beyond this threshold, the performance of the ZF phase-only precoder deteriorates). 

In Fig. [3l for the proposed CE precoder, we had observed that for a fixed M and fixed desired per- 
user information rate, with "sufficiently large" N, the total transmit power can be reduced linearly with 
increasing A^. We next try to understand as to how "large" must be, so that Pt/ct"^ can be reduced by 
roughly 3 dB with every doubling in A^ (fixed M), while maintaining a fixed achievable per-user ergodic 

'^Note that the ZF phase-only precoder does not necessarily have a lower complexity than the proposed CE precoder. This is because, 

the ZF phase-only precoder needs to compute the pseudo-inverse of the channel gain matrix (a M x A'^ matrix) and also the matrix vector 

product of the pseudo-inverse times the information symbol vector u. Computing the pseudo-inverse has a complexity of 0{M^N) and 

that for the matrix vector product is 0{AIN), resulting in a total complexity of 0{M^N) for the ZF phase-only precoder. In contrast, the 

proposed CE precoder does not need to compute the pseudo-inverse, and has a complexity of 0{MN) (see Section Ull-Bt as compared to 

the 0{M'^ N) complexity of the ZF phase-only precoder. 

" Since we use the cooperative upper bound to predict the Pt /f^ required by a GBC sum-capacity achieving scheme, the reported values 

of the extra Pt/ct"^ required by the considered CE precoders are infact an upper bound on the minimum extra Pt/(t^ required. 
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information rate. In Fig. [6l for a fixed M = 12 users, we plot the achievable per-user ergodic information 
rate under per-antenna CE transmission (i.e., R^^ (^) Z^-* ^ function of increasing N and Pt = Pq/N 
(i.e., we linearly decrease Pt with increasing A^, Pq = 38.4). It is observed that, the per-user ergodic 
information rate increases and approaches a limiting information rate as — )• oo (shown by the dashed 
curve in the figure). Pq = 38.4 corresponds to a limiting per-user information rate of roughly 1.7 bpcu. 
This then suggests that, in the limit as — t- oo, the per-user information rate remains fixed as long as 
Pt is scaled down linearly with increasing (this re-confirms our conclusion on the achievability of 
an 0{N) array power gain under the per-antenna CE constraint, for a i.i.d. Rayleigh fading channel). A 
similar behaviour is observed under APC (see the GBC sum capacity upper bound curve in the figure). In 
Fig. |7l similar results have been illustrated for M = 24 users and Pt = Pi/N (Pi = 72.3, corresponding 
to a limiting per-user information rate of roughly 1.7 bpcu). With regards to the question on how "large" 
must A^ be, it is now clear that A^ must at least be so large that the achievable per-user ergodic information 
rate is sufficiently close to its limiting information rate (i.e., in the flat region of the curve). In general, 
for a desired closeneso to the limiting information rate, the minimum number of BS antennas required 
depends on M. Our numerical experiments suggest that, to achieve a fixed desired ratio of the per-user 
ergodic information rate to the limiting information rate, a channel with a large M requires a large A^ 
also. As an example, for a fixed ratio of 0.95 between the achievable per-user ergodic information rate 
and the limiting information rate, a channel with M = 12 users requires a BS with at least A^ = 96 
antennas, whereas a channel with M = 24 users requires a BS with at least A^ = 192 antennas (i.e., to 
achieve an ergodic per-user information rate within 95 percent of the limiting information rate requires a 
BS with roughly 8 times more number of antennas than the number of users). 

VI. Conclusion 

In this paper, we have considered per-antenna constant envelope (CE) transmission in the downlink 
of multi-user MIMO systems (GBC) employing a large number of BS antennas. Under certain mild 
conditions on the channel, even with a stringent per-antenna CE constraint, array power gain can still be 
achieved. We have also proposed a low-complexity CE precoding scheme. For the proposed CE precoding 
scheme, through exhaustive simulations for the i.i.d. Rayleigh fading channel, it is shown that, compared 

'^Closeness could be expressed in terms of the achievable per-user ergodic information rate being greater than a specified percentage of 
the limiting information rate. 
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to an APC only constrained GBC, the extra total transmit power required by the proposed CE precoder 
to achieve a given per-user ergodic information rate is small (usually less than 2 dB for scenarios of 
interest). Typically, a non-linear power-efficient amplifier is about 4 — 6 times more power-efficient than 
a highly linear amplifier [fTTI . Combining this fact with the fact that per-antenna CE signals require an 
extra 2 dB transmit power, we arrive at the conclusion that, for a given desired achievable information 
sum-rate, with sufficiently large N, a base station having power-efficient amplifiers with CE inputs would 
require 101ogio(4) — 2.0 = 4.0 dB lesser total transmit power compared to a base station having highly 
linear power- inefficient amplifiers with high PAPR inputs. 
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Appendix A 

Convergence (in distribution) of the sequence jzArj 

The convergence in distribution of the sequence of random variables |zAr| (as — oo with fixed M) 
is stated and proved in Theorem [2l Its proof relies on three known results which have been stated below. 

Result 1: (Multivariate Central Limit Theorem (CUT)) Let F„ denote the joint cumulative distribution 
function (c.d.f.) of the A;-dimensional real random variable {Xn \ ■ ■ ■ , xji'^), n = 1,2,... and for each real 

vector A = (Ai, A2, ■ ■ ■ , A^)^, let Fa„ be the c.d.f. of the random variable AiX^^^ + A2A'^^-' H \-XkXn^. 

A necessary and sufficient condition for Fn to converge to a limiting distribution (as n — 00) is that Fau 
converges to a limit for each vector A. 

Proof - For details please refer to [f T2| . ■ 

This result basically states that, if F is the joint c.d.f. of a fc-dimensional real random variable 
(X(i), A:(2), . . . , a:(*^)), and if Fa„ F^ foiH each vector A, then F„ ^ F as n ^ 00. 

Result 2: (Lyapunov-CLT) Let n = 1, 2, . . . be a sequence of independent real-valued scalar 

random variables. Let E[X„] = E[(X„ — UnY] = CTn^ and for some fixed ^ > 0, E[|X„ — = /3„ 

exists for all n. Furthermore let 

n 1 n 2_ 

Bn^iY.^^)"^ ^C^-iE-'Y- (21) 



i=l 1=1 

Then if lim ^ = 0, (22) 

the c.d.f. of Yn = ^ ' converges (in the limit as n — )• 00) to the c.d.f. of a real Gaussian random 
variable with mean zero and unit variance. 

Proof - For details please refer to [|T3l . ■ 
Result 3: (Slutsky's Theorem) Let {Xn} and {Yn} be a sequence of scalar random variables. If 
converges in distribution (as n — t- 00) to some random variable X, and {Yn} converges in probability to 
some constant c, then the product sequence converges in distribution to the random variable cX. 

Proof - For details please refer to ([TTl . ■ 
Theorem 2: For any channel sequence {H^v} satisfying the conditions in Q, the associated sequence 
of random vectors {z^v} (defined in ([8])) converges (as — 00 with fixed M) in distribution 
to a multivariate 2M-dimensional real Gaussian random vector X = {Xl,X^, - ■ ■ ,X|j,X^)^ with 

" Fa is the c.d.f. of AiX'^^ + • • • + XkX^''\ 
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1,2,. 



,M (note that 



independent zero-mean components and var(X^) = var(X^) = Cfc/2, /c 
Cfc , = 1, 2, . . . , M is defined in ©). 

Proof - Consider a multivariate 2M-dimensional real random variable {X[, X^, ■ ■ ■ , X^^^, X^.j), whose 
components are i.i.d. real Gaussian with mean zero and var(X^) = var(X^) = Cfc/2 , /c = 1, 2, . . . , M. 
Then, for any vector A = (A{, Af , ■ ■ ■ , Ajj^, A^^)^ G M^^^ the scalar random variable (A{X( + A^ + 

1" ^M^ii + -^m^m) ^^^^ Gaussian with mean zero and variance Ylik=i '^k{{^kf + (-^fc )^) /2- 

If we can show that for any arbitrary vector A G M^^^, the limiting distribution of z^A is also real 
Gaussian with mean zero and the same variance Xlfli Cfc ((A^)^ + (A^)^)/2, then using Result [Hit will 
follow that the c.d.f. of converges to the c.d.f. of {X{,X^ ^ ■ ■ ■ ,Xj[,^,X^) as X — t- oo. This would 
then complete the proof. 

In the following we show that under the assumptions stated in for any arbitrary vector A G R^^, 

indeed the limiting distribution (i.e., as X — t- oo with fixed M) of z^A is real Gaussian with mean zero 

and variance Yl!k=i(^k{{^kf + {^2)^)/'^^ thereby completing the proof. 
For a given 2M-dimensional real vector A = (Af , A^ , ■ ■ ■ , A^)^, let 

M 

Cat = z^A = 2^(Aj;Zj; +A^-^ 



-^k 



(23) 



fe=i 



From the above definition and ([8]), it follows that r.v. C^m can be expressed ac^ 

TV N 

^(a^ cos(6'j) + sin(6'j)) = ^ J af + bf cos( 



C 



N 



tan 



1 bi , 



A Yjk=li'^k^kA +^k^k,i ) 



where h 



k,i 



k,i 



N 

AN) 



A 



J2k'=ii^k K.t ' ~ K^k,i ) 



N 



lm{hf,- ). We further define 



A 



af + bf cos( 



tan — ) 



1,2, 



(24) 
(25) 

X are also 



Since, the phase angles 9i,i = 1,2, ... ,N are independent of each other, rji, i 

independent. Therefore, Civ is nothing but the sum of N independent random variables. We can therefore 

apply the Lyapunov-CLT (Result |2l) to study the convergence of the c.d.f. of (^j^ as N oo. 

We firstly see that fii = E[T]i] = and = E[r]f] = {af + bf)/2 since 9i is uniformly distributed 
in [— 7r,7r). We next show that the conditions of the Lyapunov-CLT ( (l22l) in Result O are satisfied with 
^ = 2. We see that 



l3,=E[r^f] = {af+bf)^E[cos\0,^t^n~^- 

ai ' 



2\2 



(26) 



"Note that the randomness in zjv is only due to the random variables 6i , i = 1,2, 



,N. 
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exists for all i. In order that the condition in (|22l) is satisfied, we must show that 



lim 5^ = (27) 

-/V— >oo Gat 



where 



N 1 „ N 1 AT 1 AT 



2—1 i— 1 i— 1 



As a note, from (1241) it follows that both Bj^ and Cat are strictly positive for all > M. Since M is 
fixed, proving (|271) is therefore equivalent to proving that 



lim -f (29) 



Using ([61) we firstly show that 



(30) 



i.e., (7^ converges to a constant as — t- oo. We then show that, again under 



N 



lim B% = (31) 



■>oo 



Equation dlH) would then follow from dSO]) and (HB- We next show dSO]). Using dlH]) we have 2C[ 



N 



J2iLi{(^'i + bf). Expanding the expressions for and bi in J^iLii^^l + ^j^) using (|24)) . we have 

M llu(^)||2 M M v-Af Z^/'^'^/'"' I /,Q*"S 

+ (A^AJ5 _ AgAf) ^-^^'^-- V. ) |^ (32) 

From As.l and As. 3 in ^ it follows that 

lim — = , lim '■ -— '■ = , lim — = Cfc. (33) 

N^oo N N-^oo N N^oo N 

Using (|33l) in (|32l) and taking the limit as A^ — )■ cxd we get (|30l) (note that M is fixed). We now show (|3T1) . 
Before proceeding further, we define the complex numbers = (A^ + j^k)^ ^ — 1)2,..., M. Expanding 
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the expressions for and bi inside the summation in 5^ (see (|28] )) we have 



AT ' ^ ^ AT 

I, k=l k=l l=k+l 

N A/ |\ 1211, (^)|2 



N M I, |2|i,(^)|2 r M A/ A/ / ^pA^ I ?, (^) |2p . / l (^)* ^ 

1=1 fc=l J Lfcl = lfc2 = li2=fc2 + 1 \ 



AfcJ lm(Afc2A;J — 



+^Z^Z^ 1. 2^ <^ Re(Afc,A,jRe(Afc,A,J + 

fcl = l fc2 = l il=fcl+l i2=fc2 + l I 

Re(Afe^AiJIm(A^,^A,J + 

Im(Afc^A;JRe(A;,,^A,J + 

Im( Afe^ Ai, )Im(A^;^ A,, ) \ . (34) 

From (As.2) in ^ it follows that for all ki, /cg, ^2 e (1, 2, . . . , M) 

lim -p: — = , lim -pz '■ — = 

J^oo ^V^ = ' iToo = 0- ^^^^ 

Substituting (|35]) into (|34l ) and taking the limit, we have 

i^oos''^ = L(L — M — J ^^^^ 

I, ,;=i fc=i ) 

Further, 

( N AI |\, |2|L(Af)|2 A/ A/ / y-W | L(iV) ,2, , (W) ,2 \ 

I. 1=1 fe=l J fcl=lfe2 = l \ / 



From (As.2) in Q it follows that limAr_5.oo ( — ^ — ) = and therefore using this result in (l37l) 
and (l36l) we get (|3T1) . From (l30l) it follows that Cf^ converges to a positive constant as — t- oo. Hence 
we have now shown (|29l) , and therefore the Lyapunov-CLT conditions for the convergence of the c.d.f. 
of the random variable Ctv are indeed satisfied. 

Therefore invoking Result |2] (Lyapunov-CLT), it follows that the c.d.f. of Cn/Cn converges to the 
c.d.f. of a zero mean real Gaussian random variable with unit variance. Further, since Cn converges 



to the constant v/| Efcli Cfc((Ai)2 + (A^)2) (see dSO])), using Result [3] (Slutsky's Theorem) it follows 
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that the c.d.f. of (j^f converges to the c.d.f. of a zero mean real Gaussian random variable with variance 

Appendix B 



Probability of the Box Event jzjv e B^{u) 

Theorem 3: For a given channel sequence {aN}N=M satisfying Q and a given fixed finite alphabet 
set U, for any A > 0, there exist a corresponding integer N({ii]\f},U, A), such that for all N > 
N{{IIn},U,A) (with fixed M) 

Prob(z;v e -B^(u)) > , Vu G W. (38) 

where B^{u) is defined in ([9]). 

Proof - To prove this result, we use the following expansion for the probability of a box event for 
any general multivariate n-dimensional real r.v. X = (Xi, X2, ■ ■ ■ ,Xn). We consider the probability 
that X lies in a n-dimensional box centered at a = G M" and denoted by C(A, a) = 

{(xi, X2, ■ ■ ■ , Xn) G I ctfc — A < ajfc < ttfc + A , /c = 1, 2, . . . , n}. For notational convenience, we refer 
to ctfc + A and — A as the corresponding "upper" and "lower" limits for the A;-th coordinate. The 
probability that X lies in the box C(A, a) is given by 

n 

Prob(Xe C(A,a)) = ^(-l)*^Tfe(A, a) (39) 

k=0 

where Tfc(A,a) is the probability that the r.v. (Xi, X2, ■ ■ ■ , Xn) belongs to a sub-region of 
{(xi, ■ ■ ■ , Xn) G I X/ < ai + A , I = 1, 2, . . . , n}, where exactly k coordinates are less than their 
corresponding "lower" limit and the reroairiing n — k coordinates are less than their corresponding "upper" 
limit. Specifically, Tfc(A,a) is given b>cl 

n n n 

Tk{A,a) ^ J2 ■■■ J2 Prob(x, <a,-A Vre{ii,i2,--- , ^r<ar + A Vr^{^l,^2,••■ ,«fc}) (40) 

41 = 1*2=41 + 1 ik=ik-l + i 

Using the expansion in (|39l ). the probability of the box event jz^r G i3^(u)| can be expressed as 

Prob^ZAT e 6^(u)) = Prob((/B^wi - A) < z^'"^' < + A) , (v^u^ - A) < zf^^ < (/b^w^ + A) 

, fc = l,2,...,A/ 

2M 2M 2M 2M 

= E(-1)'E E E Prob(zW<v^^i,-A V/e{zi,Z2,---,«fc}, 

fc=0 n=l!2=ii+l 4fc=ifc-i+l 

^""^ <^/Eiul + Ayl^{^l,^2,■■■ ,^k}) (41) 



^' As an example, for n = 2, we have Prob(^ai - A < Xi < ai + A , 02 - A < X2 < 02 + A j = To(A,a) - Ti(A, a) + T2(A, a), 
where ro(A, a) = Prob(Xi < ai + A , X2 < 02 + A), r2(A, a) = Prob(Xi < qi - A , X2 < 02 - A), and ri(A, a) = Prob(Xi < 
ai + A , 2 < 02 - A) + Prob(Xi < ai - A , X2 < 02 + A). 
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where z[^^ is the Z-th component of zjv (i.e., z\^'^ = zfl'^ ' for even /, and 



= .f/;;)/, for odd /) 



and ui is the /-th component of the vector u^, m^, ■ ■ ■ , wj^^, u^jY . For notational convenience we 
define 

r'"'(fc,ii,i2,--- ,ifc,u,A) =Prob(4'^^ < /Eiii/- A e {^l,^2,••■ M , 4"^^ < /b^u; + A V/ ^ {zi, ^2, • ■ ■ 

1 < ii < 12 < • • • < ifc < 2A/ , < fc < 2A/. (42) 

Let Y = (Fi,F25'"" 5 ^2 A/) denote a multivariate 2Af-dimensional real Gaussian random variable 
with independent zero mean components and var(F2fc-i) = var(F2fc) = Ck/2,k = 1,2, ...,M. 
From Theorem [2] it follows that the c.d.f. of converges to the c.d.f. of Y in the limit as 
N ^ oo. This convergence in distribution implies that, for any given arbitrary 5 > 0, for each term 
T*'^' {k, ii, Z2, ■ ■ ■ ! ik, u. A), there exists a corresponding positive integer N{k, zi, ^2, ■ ■ ■ , ik, S, u. A) such 
that for all > N{k, ii, ^2, ■ " " ? 'ik, u. A) 



(43) 



T {k,ii,i2,--- ,ik,u,A) - Pmh[Yi<^/Eiui-Ayie{ii,i2,---,ik} 

Yi<y%ui + A\fl^{H,i2,---,ik}] < S. 

We then choose a positive integer (7({Hjv}, u. A, 6) given by 



g{{UN},u,A,S) 



max max N(kAi,i2, 

A:=0,1---- ,2J\/ l<n<j2<---<4fc<2J\f 



,ik,S, u, A) 



(44) 



Combining dH]), (gZ)) and (gS]), for all iV > fi'({Hiv}, u. A, 5) we have 



Prob(zAr e S^(u)) - Prob(Y G S^(u)) 

2M 2j\f 2M 2M ( 



fc=0 n=l i2=ii+l •jfe=ifc_i+l 



2M 2M 2J\/ 2Af 

^ EE E - E 

fe=0 ii=l 'i2=il + l 'ifc=ifc_i+l 



Yi < ^/Eiui + A yii{h,i2,--- ,«fe}j 
r'"'(fc,ii,i2, ■ ■ ■ ,ifc,u, A) - Prob(^Y; < ^/E'lUl - A Ml e {^1,^2, • • • , ifc} , 

Yi < VElui + A yi i {^l,^2,••• ,ife}^ 



^ y^2M y^2A/ y^2A/ y^2Af 



S = 22^^5, 



(45) 



Since the range space (support) of Y is the entire space M^^^, it follows that Prob(Y G i3^(u)) > (i.e., 
strictly positive) for any A > and all u E U. For the given information symbol vector u and A > 0, 
we choose a corresponding 5 given by 



A 1 Prob(Y £ gju)) 



(46) 
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From dH]) and (|46l) it now follows that, for all N > g[{liN}, u, A, 6{u, A)) we have 

J ProbfY e 6^(u)) 
Prob(zw e (u)) - Prob(Y e 6^ (u)) | < A) = ^ 2 ^^^^ 

which then implies that 

, ^, Prob(Y e 6. (u)) 
Prob(zAr e 6^(u)) > ^ 2 ^ ° ^^^^ 

i.e., Prob(z7v G i3^(u)) is strictly positive for > (^({Hjy}, u, A, 5(u, A)) . For a given channel sequence 
{Hat}, a finite W and A > 0, we define the integer 

iV({HAr},Z^,A) = max.g({HAr},u,A,5(u,A)). (49) 
Combining this definition with the result in (l48l) proves the theorem. ■ 
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TABLE I 

Minimum Prjo'^ (DB) required to achieve a per-user ergodic rate of 2 bpcu 





N=60 


N=80 


N=100 


N=120 


N=160 


N=200 


N=240 


N = 320 


N = 400 


GBC Sum Capacity Upper Bound (M = 10) 


-2.8 


-4.0 


-5.1 


-5.8 


-7.2 


-8.2 


-8.9 


-10.2 


-11.2 


Proposed CE Precoder (M = 10) 


-0.8 


-2.1 


-3.3 


-4.1 


-5.5 


-6.5 


-7.2 


-8.6 


-9.6 


Power Gap (M = 10) 


2.0 


1.9 


1.8 


1.7 


1.7 


1.7 


1.7 


1.6 


1.6 


GBC Sum Capacity Upper Bound (M = 40) 


3.8 


2.4 


1.3 


0.6 


-0.9 


-2.0 


-2.7 


-4.1 


-5.1 


Proposed CE Precoder (M = 40) 


9.2 


6.0 


4.1 


3.2 


1.4 


-0.1 


-0.9 


-2.3 


-3.5 


Power Gap (M = 40) 


5.4 


3.6 


2.8 


2.6 


2.3 


1.9 


1.8 


1.8 


1.6 
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Fig. 1. Reduction in the ergodic per-user MUI energy EH[|Sfcp] with increasing A'^. Fixed A/, fixed U\ = ■ ■ ■ = IAm = 16-QAM, Gaussian 
and fixed Ek ^ I , k = 1,2, . . . , M. IID CU{Q, 1) Rayleigh fading. 
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Fig. 2. E* vs. A'^ for a fixed desired ergodic MUI energy level Ik (same for each user). Fixed M — 12, fixed Ui 
16-QAM, Gaussian. IID CAf(0, 1) Rayleigh fading. 
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Fig. 3. Required Pt/ct^ vs. A'^, to achieve a fixed desired ergodic per-user rate = 2 bpcu. Gaussian information alphabets Wi — 
IID C7V(0, 1) Rayleigh fading. 
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<>- ZF Phase-only Precoder 
-B — Proposed CE Precoder 
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Fig. 4. The extra Pt/ct^ (in dB) required (vertical axis) by thie proposed CE precoder and by the ZF phase-only precoder, respectively, 
to achieve the same ergodic per-user information rate as predicted by the GBC sum-capacity cooperative upper bound (horizontal axis). 
Here the number of base station antennas is TV = 48 and the number of users is M = 12. All users use Gaussian information alphabets 
Ui = ■ ■ ■ = Um — Gaussian and all channels are i.i.d. CJ\f{0, 1) Rayleigh fading. 
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Fig. 5. Same as Fig. |4] but for = 480 base station antennas. 
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96 BS antennas required to achieve an 
information rate which is 95% of the limit. 
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Information rate limit 
for the proposed CE precoder 



= 38.4 / N 
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Fig. 6. Ergodic per-user information rate for a fixed M = 12, witli tlie total transmit power scaled down linearly with increasing A^. 
Gaussian information alphabets Wi = ■ ■ ■ = Um. IID CM{Q, 1) Rayleigh fading. 
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192 BS antennas required to achieve an 
information rate within 95% of the limit. 
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Fig. 7. Same as Fig. |6] but with a fixed M = 24 and Pt = 72.3/ N. 



